Python Program
– To Identify Silent Mutations for the Introduction of Restriction Sites in Open Reading Frames
Project Team
Associate Professor Anuradha Mathrani
School of Mathematical and Computational Sciences
Massey University
Auckland, New Zealand
Email: a.s.mathrani@massey.ac.nz
Web: My Profile
Associate Professor Evelyn Sattlegger
School of Food Technology and Natural Sciences
Maurice Wilkins Centre for Molecular BioDiscovery
Massey University
Palmerston North, New Zealand
Email: e.sattlegger@massey.ac.nz
Web: My profile
Our Project
For many applications in molecular biology, restriction sites need to be engineered into an open reading frame (ORF), a part of the genetic material that codes for a protein. Importantly, silent mutations need to be performed because only these do not alter the amino acid sequence of the protein. However, finding such silent mutations is a very time-consuming process. We have developed a program which recognizes all silent mutations in an open reading frame (ORF) that each lead to a new restriction site. Our program uses python technologies comprising web crawlers and data analysis libraries to deduce the amino acid sequence coded by an ORF and convert the ORF nucleotide sequence into the amino acid single letter sequence. In doing so, reverse translation back into the nucleotide sequence allows the consideration of all possible nucleotide sequences coding for the same amino acid sequence, which are then compared with the restriction recognition sites of commercially available restriction enzymes, such as from New England Biolabs (e.g., https://www.neb.com/). This allows the identification of restriction sites that can be engineered via silent mutations within the provided DNA input sequences.
Download the test application
The downloadable project .ZIP file bundlecontains 8 individual files, which are described below:
- 1. CheckRestrictionEnzyme.py
- This is the source code in Python which user can download and compile themselves
- 2. CheckRestrictionEnzyme.exe
- This is the Executable file on a Microsoft Windows operating system
- 3. TestDNA.txt
- The sample DNA sequence text that can be inputted in the program
- 4. NEBEnzymeList.csv
- This file contains the Enzyme data extracted from New England BioLabs (https://www.nebiolabs.co.nz/tools-and-resources/usage-guidelines/nebuffer-performance-chart-with-restriction-enzymes)
- 5. TemplateEnzymeList.csv
- Template provided to user if they want to upload their own list
- 6. Instructions.doc
- Brief user instructions are provided here
- 7. Detailed Instructions.doc
- Flowchart with detailed instructions